Learning to Segment a Video to Clips Based on Scene and Camera Motion
نویسندگان
چکیده
In this paper, we present a novel learning-based algorithm for temporal segmentation of a video into clips based on both camera and scene motion, in particular, based on combinations of static vs. dynamic camera and static vs. dynamic scene. Given a video, we first perform shot boundary detection to segment the video to shots. We enforce temporal continuity by constructing a Markov Random Field (MRF) over the frames of each video shot with edges between consecutive frames and cast the segmentation problem as a frame level discrete labeling problem. Using manually labeled data we learn classifiers exploiting cues from optical flow to provide evidence for the different labels, and infer the best labeling over the frames. We show the effectiveness of the approach using user videos and full-length movies. Using sixty full-length movies spanning 50 years, we show that the proposed algorithm of grouping frames purely based on motion cues can aid computational applications such as recovering depth from a video and also reveal interesting trends in movies, which finds itself interesting novel applications in video analysis (time-stamping archive movies) and film studies.
منابع مشابه
An Improved Motion Vector Estimation Approach for Video Error Concealment Based on the Video Scene Analysis
In order to enhance the accuracy of the motion vector (MV) estimation and also reduce the error propagation issue during the estimation, in this paper, a new adaptive error concealment (EC) approach is proposed based on the information extracted from the video scene. In this regard, the motion information of the video scene around the degraded MB is first analyzed to estimate the motion type of...
متن کاملEstimation of Camera Parameters in Video Sequences with a Large Amount of Scene Motion
Most of existing techniques to estimate camera motion is based on analysis of the optical flow. However, such methods can be inaccurate and/or inefficiently when applied in video sequences which have a large amount of motion or a large number of scene changes. In this paper, we present an approach to estimate camera motion based on analysis of local invariant features. Such features are robust ...
متن کاملTraffic Scene Analysis using Hierarchical Sparse Topical Coding
Analyzing motion patterns in traffic videos can be exploited directly to generate high-level descriptions of the video contents. Such descriptions may further be employed in different traffic applications such as traffic phase detection and abnormal event detection. One of the most recent and successful unsupervised methods for complex traffic scene analysis is based on topic models. In this pa...
متن کاملRobust Estimation of Camera Motion using Local Invariant Features
Most of existing techniques to estimate camera motion are based on analysis of the optical flow. However, the estimation of the optical flow supports only a limited amount of scene motion. In this report, we present a novel approach to estimate camera motion based on analysis of local invariant features. Such features are robust across a substantial range of affine distortion. Experiments on sy...
متن کاملRetrieve video clips using the global motion information
Retrieve video clips using the global motion information Tianli Yu and Yujin Zhang Department of Electronic Engineering, Tsinghua University, Beijing 100084, China In this paper, a new scheme is proposed to extract the global motion model from general video sequence. Since global camera motion often conveys the semantic meaning of the video’s content, these global model parameters are used as f...
متن کامل